Detecting and Statistically Correcting Sample Selection Bias

نویسندگان

  • Gary Cuddeback
  • Elizabeth Wilson
  • John G. Orme
  • Terri Combs-Orme
چکیده

Researchers seldom realize 100% participation for any research study. If participants and non-participants are systematically different, substantive results may be biased in unknown ways, and external or internal validity may be compromised. Typically social work reGary Cuddeback, MSW, MPH, is Research Associate, Cecil G. Sheps Center for Health Services Research, University of North Carolina at Chapel Hill, 101 Connor Drive, Suite 302, Chapel Hill, NC 27514. Elizabeth Wilson, MSW, is a Doctoral Student, John G. Orme, PhD, is Professor, and Terri Combs-Orme, PhD, is Associate Professor, University of Tennessee, College of Social Work, Children’s Mental Health Services Research Center, Henson Hall, Knoxville, TN 37996-3332. Address correspondence to: Gary S. Cuddeback, Cecil G. Sheps Center for Health Services Research, University of North Carolina at Chapel Hill, 101 Connor Drive, Suite 302, Chapel Hill, NC 27514 or John G. Orme, University of Tennessee, College of Social Work, Children’s Mental Health Services Research Center, 128 Henson Hall, Knoxville, TN 37996-3332 (E-mail: Gary S. Cuddeback at [email protected]. unc.edu or John G. Orme at [email protected]). The authors would like to express their appreciation to Shenyang Guo for his suggestions on an earlier version of this paper and to Niki Le Prohn at Casey Family Programs for her support. This article is the result of a collaborative effort of the Casey Family Programs Foster Family Project at The University of Tennessee and Casey Family Programs, an operating foundation delivering foster care, adoption, and other permanency planning services in 14 states. You may review more information about Casey Family Programs at http://www.casey.research.org/research and more information about Casey Family Programs Foster Family Project at The University of Tennesee at http://www. utcmhsrc.csw.utk.edu/caseyproject/. Journal of Social Service Research, Vol. 30(3) 2004 http://www.haworthpress.com/web/JSSR  2004 by The Haworth Press, Inc. All rights reserved. Digital Object Identifier: 10.1300/J079v30n03_02 19 searchers use bivariate tests to detect selection bias (e.g., χ to compare the race of participants and non-participants). Occasionally multiple regression methods are used (e.g., logistic regression with participation/ non-participation as the dependent variable). Neither of these methods can be used to correct substantive results for selection bias. Sample selection models are a well-developed class of econometric models that can be used to detect and correct for selection bias, but these are rarely used in social work research. Sample selection models can help further social work research by providing researchers with methods of detecting and correcting sample selection bias. [Article copies available for a fee from The Haworth Document Delivery Service: 1-800-HAWORTH. E-mail address: Website: © 2004 by The Haworth Press, Inc. All rights reserved.]

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Outcomes from stroke rehabilitation in Veterans Affairs rehabilitation units: detecting and correcting for selection bias.

This paper addresses the issue of statistical selection bias in multivariate models of functional gain estimated from observational data. Stroke patients from 20 high-volume Veterans Affairs Medical Centers (VAMCs) with acute and subacute inpatient rehabilitation treatment units were observed. Their gains in overall, motor, and cognitive functional status were measured with the use of the Funct...

متن کامل

Correcting sample selection bias in maximum entropy density estimation

We study the problem of maximum entropy density estimation in the presence of known sample selection bias. We propose three bias correction approaches. The first one takes advantage of unbiased sufficient statistics which can be obtained from biased samples. The second one estimates the biased distribution and then factors the bias out. The third one approximates the second by only using sample...

متن کامل

Correcting sample selection in FARS data to estimate seatbelt use.

INTRODUCTION In this paper, we show that FARS data can be a comparable alternative to observational NOPUS data in estimating seat belt use in the United States once we correct for sample selection bias. RESULTS Based on assumptions of independence for seatbelt choice, we establish a lower and upper bound for seatbelt usage rates, and find that once we correct for sample selection bias, the se...

متن کامل

Correcting Sample Selection Bias by Unlabeled Data

We consider the scenario where training and test data are drawn from different distributions, commonly referred to as sample selection bias. Most algorithms for this setting try to first recover sampling distributions and then make appropriate corrections based on the distribution estimate. We present a nonparametric method which directly produces resampling weights without distribution estimat...

متن کامل

Correcting for Survey Nonresponse Using Variable Response Propensity

All surveys with less than full response potentially suffer from nonresponse bias. Poststratification weights can only correct for selection into the sample based on observables whose distribution is known in the population. Variables such as gender, race, income, and region satisfy this requirement because they are available from the U.S. Census Bureau, but poststratification based on these va...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004